The accuracy of methods for coding and sampling higher-level taxa for phylogenetic analysis: a simulation study.

نویسنده

  • J J Wiens
چکیده

Many phylogenetic analyses, particularly morphological studies, use higher taxa (e.g., genera, families) rather than species as terminal taxa. This general approach requires dealing with interspecific variation among the species that make up the higher taxon. In this paper, I review different parsimony methods for coding and sampling higher taxa and compare their relative accuracies using computer simulations. Despite their widespread use, methods that involve coding higher taxa as terminals perform poorly in simulations, relative to splitting up the higher taxa and using species as terminals. Among the methods that use higher taxa as terminals, coding a taxon based on the most common condition among the included species (majority or modal coding) is generally more accurate than other coding methods, such as coding taxa as missing or polymorphic. The success of the majority method, and results of further simulations, suggest that in many cases "common equals primitive" within variable taxa, at least for low and intermediate rates of character change. The fixed-only method (excluding variable characters) performs very poorly, a result that is indirectly supported by analyses of published data for squamate reptiles. Sampling only a single species per higher taxon also yields low accuracy under many conditions. Along with recent studies of intraspecific polymorphism, the results of this study show the general importance of (1) including characters despite variation within taxa and (2) using methods that incorporate detailed information on the distribution of states within variable taxa.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The use and validity of composite taxa in phylogenetic analysis.

In phylogenetic analysis, one possible approach to minimize missing data in DNA supermatrices consists in sampling sequences from different species to obtain a complete sequence for all genes included in the study. We refer to those complete sequences as composite taxa because DNA sequences that are combined belong to different species. An alternative approach is to analyze incomplete supermatr...

متن کامل

Phylogenetic Analysis of Beta-Glucanase Producing Actinomycetes Strain TBG-CH22 - A Comparison of Conventional and Molecular Morphometric Approach

Actinomycetes are inexhaustible producers of commercially valuable metabolites, are continually screened for beneficial compounds. The taxonomic and phylogenetic study of novel actinomycetes strains are mostly based on conventional methods and primary DNA structure of 16s rRNA. Although 16s rRNA sequence is well accepted in phylogeny studies, its secondary structures have not been widely used. ...

متن کامل

A taxonomic study of cyanobacteria in wheat fields adjacent to industrial areas in Yazd province (Iran)

Culturing, isolation, purification, and identification of cyanobacteria collected from wheat field soil, in five stations around the industrial areas in Yazd province (Iran) were conducted in this study. Identification of taxa was based on morphology and molecular methods. Cluster analysis and principal component analyses performed using SPSS software and rate of resemblance among the taxa were...

متن کامل

Phylogenetic Analysis of Beta-Glucanase Producing Actinomycetes Strain TBG-CH22 - A Comparison of Conventional and Molecular Morphometric Approach

Actinomycetes are inexhaustible producers of commercially valuable metabolites, are continually screened for beneficial compounds. The taxonomic and phylogenetic study of novel actinomycetes strains are mostly based on conventional methods and primary DNA structure of 16s rRNA. Although 16s rRNA sequence is well accepted in phylogeny studies, its secondary structures have not been widely used. ...

متن کامل

Phylogenetic Analysis of Three Long Non-coding RNA Genes: AK082072, AK043754 and AK082467

Now, it is clear that protein is just one of the most functional products produced by the eukaryotic genome. Indeed, a major part of the human genome is transcribed to non-coding sequences than to the coding sequence of the protein. In this study, we selected three long non-coding RNAs namely AK082072, AK043754 and AK082467 which show brain expression and local region conservation among vertebr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Systematic biology

دوره 47 3  شماره 

صفحات  -

تاریخ انتشار 1998